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We consider the problem of distributed compression for correlated quantum sources. The 
classical version of this problem was solved by Slepian and Wolf, who showed that dis- 
tributed compression could take full advantage of redundancy in the local sources created 
by the presence of correlations. We show that, in general, this is not the case for quan- 
tum sources by proving a lower bound on the rate sum for irreducible sources of product 
states which is stronger than the one given by a naive application of Slepian- Wolf . Nonethe- 
less, strategies taking advantage of correlation do exist for some special classes of quantum 
sources. For example, Devetak and Winter demonstrated the existence of such a strategy 
when one of the sources is classical. Here we find optimal non-trivial strategies for a dif- 
ferent extreme, sources of Bell states. In addition, we illustrate how distributed compres- 
sion is connected to other problems in quantum information theory, including information- 
disturbance questions, entanglement distillation and quantum error correction. 
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I. INTRODUCTION 



The insights that have come from efforts to study quantum mechanics from an information- 
theoretic point of view are profound and wide-ranging, demonstrating that quantum information 
can be compressed iHQ], stabilised 0] and usefully processed |4]. Schumacher's theorem 1110,13, 
in particular, demonstrated the fungibility of quantum states by quantifying their compressibility, 
justifying the use of the qubit as the fundamental unit of quantum information. 

In this paper we consider a distributed variant of the problem posed by Schumacher. Namely, 
we suppose that a source distributes quantum states to two or more parties, who independently 
compress the states before sending them on to a receiver, who is required to be able to reconstruct 
the original inputs. Since many ideas for the design of quantum computers and other quan- 
tum information processing devices envision a network of relatively small quantum processors 
sending quantum information between nodes 0, 0], finding good protocols for distributed com- 
pression of quantum data could conceivably have important practical benefits. More generally, 
much of quantum information theory is concerned with the manipulation of data under locality 
constraints |^, so our problem connects naturally to these investigations. 

We present two main results. First, we show that, in stark contrast to the classical case, in- 
dependent encoders frequently can take relatively little advantage of the correlations present be- 
tween their states: we prove this via a bound on the achievable rate sum for sources generating 
irreducible sets of product vectors. On the other hand, it is possible to do much better for some 
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special classes of sources. We show, in particular, that for sources of Bell states, independent en- 
coders can take full advantage of correlations. The achievable rates, however, are governed by 
different formulas than in the classical case, reflecting the quantum nature of the correlations in 
the input states. 

The paper is structured as follows. Section^gives a formal definition of the distributed com- 
pression problem and shows how questions about cloning, imprinting and quantum error 
correction can be formulated in that framework. It also gives a statement of the classical theorem 
governing distributed compression due to Slepian and Wolf before summarizing previous work 
on the quantum version. Section contains the statement and proof of our tighter bound for 
irreducible sources of product states. Section |IV| finds the achievable rate region for sources gen- 
erating Bell states. Section |V| then provides some further examples, where it seems likely that the 
optimal rates lie somewhere between full utilization of correlations and no utilization at all. We 
end with a discussion and some open problems. 

We use the following conventions throughout the paper. If Sab = {Pi^vf^} is an ensem- 
ble of bipartite states then we write £a for the ensemble {pi, ^pf} of reduced states on system A. 
Sometimes we omit subscripts (or superscripts) labelling subsystems, in which case the largest 
subsystem on which the ensemble (or state) has been defined should be assumed: £ = Sab and 

= ip^^ . We identify states with their density operators and if | (/?) is a pure state vector, we use 
the notation ip = \(p){(p\ for its density operator. The function S{p) is the von Neumann entropy 
S{p) = — Tr plog p and S{£) the von Neumann entropy of the average state of the ensemble £. 
Functions like S{A\B)p and S{A : B\C)p are defined in the same way as their classical counter- 
parts: 

S{A : B\C)p = + - 5(p^^^) - (1) 

for example. x('?) is the Holevo x quantity of £ Throughout, log and exp are taken base 2. 

II. DEFINITION AND EXAMPLES 

We now give a more formal definition of the distributed compression problem. For conve- 
nience, our definition will refer to the case of two encoders, henceforth known as Alice and Bob. 
The extension to any finite number of parties is straightforward. Our receiver will be named Char- 
lie. Consider an ensemble of bipartite quantum states £ab = {Pi, on a finite-dimensional 
Hilbert space Hab = 'Ha®'Hb and the product ensemble .5®" = fe", Iv^i")^^} on n%, where 

= iii2...in, 
Pi" = PiiPi2 ■■■Pin and 
\Vi") = 1-^12) ^ ■ ■ ■ ^ \^iJ ■ 

A source provides Alice and Bob with the state \(pi^), drawn with probability pi^. Alice and Bob 
then perform their respective encoding operations Ea and Eb- These are quantum operations, 
that is, completely positive, trace-preserving (CPTP) maps, with outputs on quantum systems 
Ca and Cb of dimensions (Ia and (Ib, respectively The joint encoding operation is Ea (E> Eb 
since Alice and Bob are required to act independently. The systems Ca and Cb are then sent to 
Charlie, who performs a decoding operation D, again a CPTP map, producing the output state 
(pin = D o [Ea EB){'Pin)- We say the encoding-decoding scheme has fidelity 1 — e if 

Pi" i^i" \p>i"\^i") > 1 - e (2) 

i" 
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FIG. 1: Achievable rate region for Slepian-Wolf encoding. 

and that {Ra,Rb) is an achievable rate pair if for all 6,e > there exists an integer N such that for 
all n > iV there is an encoding-decoding scheme with fidelity 1 — e satisfying 

— log dA < Ra + 5 and — log ds < Rb + <5. (3) 

n n 

This scenario is formulated in analogy to the asymptotically lossless setting of classical block 
compression, as opposed to lossless variable-length coding. 

We remark here that we may easily allow Alice and Bob the use of prior shared randomness, 
without affecting any of our conclusions. Indeed, randomness is unnecessary, as a look at the 
fidelity criterion Eq. l|2ll shows: the fidelity is an ensemble expectation of quantities linear in the 
output state ipin . Hence the fidelity of a randomized scheme, regardless of whether it uses shared 
or private randomness, is the average of fidelities of the schemes obtained by picking particular 
instances of the random data. So, at least one of the randomness-free schemes has a fidelity at 
least as good as the randomized version. 

The classical correlated source compression problem has a beautiful solution, due to Slepian 
and Wolf 111 111 . This remarkable theorem shows that Alice and Bob can always take advantage of 
any correlations that exist between their data. 

Theorem II.l (Slepian-Wolf 1111. See also p. 407) Let Sab = {Pi,\'Pi)A\i^i)B} such that 
\{ipi\LPj)\,\{'ipi\ipj)\ £ {0,1}. Then {Ra,Rb) is an achievable rate pair if and only if 



Ra + Rb 


> 


S{AB) 


(4) 


Ra 


> 


S{A\B) 


(5) 


Rb 


> 


S{B\A). 


(6) 



The entropies here and in our subsequent theorems are taken with respect to the average state of 
the ensemble £ab- We will refer to inequalities ^4||-l|6ll as the Slepian-Wolf bounds. Note that by 
time sharing and resource wasting, achievability of the region defined by the Slepian-Wolf bounds 
follows from the achievability of just two rate points: {S{A), ^(-Bl^)) and {S{A\B), S{B)). The 
region is depicted in Figure^ 
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FIG. 2: Cloning as distributed compression. Solid lines represent noiseless quantum channels and dashed 
lines correlation in the ensemble £ab- The encoders are each given a copy of \if) while the decoder tries to 
produce the state 



It is st raig htforward to show that the Slepian-Wolf bounds hold for all sources of quantum 
states but we will see in Section IlIII that in the general case they are freqently not achiev- 

able. In fact, achievability of the Slepian-Wolf bounds appears to be a singular phenomenon. 
Nonetheless, Devetak and Winter have generalized the coding portion of the Slepian-Wolf theo- 
rem to the situation where the states given to one party, say Alice, are quantum mechanical while 
those given to the other party are classical, meaning pure and perfectly distinguishable. For such a 
source, they show that {S{A), S{B\A)) is an achievable rate pair |15]. (In section lVBl we will com- 
bine the technique they used with a type of superdense coding to develop a coding procedure 
for partially entangled states.) Whether the point {S{A\B), S{B)) is achievable in their scenario 
remains unknown. 

Example (Cloning and information-disturbance) Let's move on to a purely quantum mechanical 
scenario, in which we will be able to relate the distributed compression problem to no-cloning and 
information-disturbance ideas. Suppose that the source generates pairs \ ^)\^) G C^(8)C^ according 
to the uniform distribution over qubit states. If Alice is given a noiseless quantum channel with 
a rate of one qubit per signal state to Charlie while Bob is given no channel at all, then perfect 
reconstruction of the input by Charlie is simply cloning. This situation is illustrated in Figure |2| 
In the approximate setting, the rate pair (1,0) is achievable if and only if there exists a sequence 
of CPTP maps Dn such that 

lim / ((/?i(/?2 • • • (/?„|I?„(|9?l9?2 • • • '/'n))|V'l¥'2 • • • '/'n) ^¥'1^952 • • • d'/'n = 1- (7) 
n— >oo J 

Similarly, if we replace the uniform ensemble over states by some other ensemble 

{pi, and again do not give Bob any capacity to communicate with Charlie, then study- 

ing distributed compression is simply an information-disturbance problem. A graphical depiction 
is given in Figure |3| On the other hand, if Alice is given a full qubit's worth of capacity and Bob 
is given some capacity greater than zero but less than a full qubit, then we are in the regime of 
information-disturbance relations with prior correlation [14], since we can assume that Charlie 
receives a state of the form \ipin){Lpin \ (g) p^n for some density operator pi^ and would like to use a 
CPTP map to convert it to a state close to 
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FIG. 3: Measurement without disturbance as distributed compression. This time the encoders are given the 
states \ipi) and E^nd the decoder attempts to produce \Lpi)\^i). 



Example (Erasure codes) Our final example hints that a full theory of the distributed compression 
of entangled states may be related to the analysis of quantum error correcting codes. Consider 
the following states: 

+ 11111)) 
+ 11100)) 

+ 11010)) 

+ 10110)). (8) 

Let £ab be the uniform ensemble for the subspace they span, giving Alice the first two qubits 
and Bob the last two. The subspace is, in fact, a type of quantum error correcting code known as 
an erasure code, capable of correcting for one error at a known position 116. 171 . Thus, (2, 1) is an 
achievable rate pair: Alice sends all of her qubits while Bob throws away half of his. Meanwhile, 
the Slepian-Wolf bounds only require that Ra + Rb ^ with no conditions on Ra and Rb in- 
dividually. Whether the pair (2, 1) is optimal, then, is actually a question about the approximate 
performance of a quantum error correcting code. 



IV'oo) 


= ^(10000) 


IV'Ol) 




IV-io) 


= ^(10101) 


IV-ll) 





III. A BOUND FOR IRREDUCIBLE PRODUCT STATE SOURCES 



The case of irreducible product state ensembles provides what is perhaps the most striking 
example of the unattainability of the Slepian-Wolf conditions. A set S of state vectors is called re- 
ducible if its elements fall into two or more orthogonal subspaces. Otherwise S is called irreducible. 
For more details on the definition and some of its equivalent formulations, see ll^ and |14]. In- 
tuitively, an irreducible set of state vectors is one for which all non-trivial measurements induce 
at least some disturbance. We say that an ensemble is irreducible if the corresponding underly- 
ing set of states is. The main result of this section is a lower bound on the attainable rate sums 
Ra + Re for irreducible ensembles. We will use two results that have been proved elsewhere LISI 
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which express the fact that an irreducible ensemble which some quantum operation leaves al- 
most invariant cannot leak much quantum information to the environment of the map. These 
statements can be thought of as approximate and asymptotic formulations of the no-cloning and 
information-disturbance principles . 

Lemma III.l (Barnum et al. 1 18], Lemma 6.1) Suppose that £ = {pi,\ai)} is an irreducible ensemble 
with K states. Suppose that the states \ai) are provided in a register A with state space Ha (ind let register 
B be an ancilla with state space Hb (with Hilbert space dimensions dA and ds)', we will refer to B as the 
environment. Let 

r -.HA^nB ^HAf^HB 

\cri)A\0)B i — ' \(.i)AB 

be a unitary map such that 

Y,PiF{w^),^^B\mi\) = ^-^■ 

i 

Let {pi,pi = Tta be the environment ensemble and let 

x{{pi,pi}) = s ^YIp^p^ ~ Y^P'^'^P^^ 

be the Holevo quantity of the environment. Then if £ is kept fixed, but e, T and dB are allowed to vary, 
we have x < /(e) where the function f satisfies /(e) ^ as e ^ 0. In fact we may take /(e) = 
a-y/e + /?A/elog ^ where a and j3 are constants. 

Proposition IIL2 (Barnum et al. 1 18], Lemma 6.4) Consider, for e > 0, 

1. an ensemble £ = £i(^ . . .®£n, where each £i is an irreducible ensemble on a state space of dimension 
at most k and with at most K signal states; 

2. an encoding-decoding scheme {E,D) on £ with average fidelity > 1 — e, leaving the environment 
in a state pi-n for input state labelled by = ii .. .in- 
Then ^x{{Pi" : Pi"}) < g{^) where g is a function satisfying g{e) as e ^ 0. Hence the amount of 
information per position tends to zero as the fidelity tends to l,for large block lengths n. 

With these tools, we can prove our main result: 

Theorem IIL3 Let£AB = {pi,\fi) A\i^i) b} be an irreducible ensemble of product states. Then a necessary 
condition for the rate pair {Ra, Rb) io be achievable for £ab is that 

/i,+i;e>SM±^M±^, (10) 

Proof The basic idea is that if {Ra, Rb) fail to satisfy Eq. ilOl , then there is not enough room in the 
compressed data to absorb all the distinguishability present in the input. Some must, therefore, 
be left behind in the environments of Alice and Bob. The amount of distinguishability allowed 
there, however, is governed by Proposition lIII.2l 

Suppose that, for some 6,e > 0, Alice and Bob have a distributed encoding-decoding scheme 
{Ea^Eb, D) for blocks of size n, with ^ log dA < Ra+S, ^ logd^ < Rb+S and fidelity 1 — e. There 
exists a unitary extension of Alice's encoding operation Ea in which the output Hilbert space 



7 



factors as TIa = Ti-WA ® '^Ca' where TCwa waste and TCca represents her noiseless quantum 
channel. Thus, Ea{p) = Tr^y^ Ua{p ® \Q){Q\)U\ for some unitary Ua and fixed ancilla state |0)(0| 
on Wa- Likewise, we can factor Bob's Hilbert space as Hb = Ti-Ws ^ '^Cb write Eb{p) = 
Trws Ub{p ® moml Now, let = E 

Pin\ipi,i){ipin \ and = X]i"?'i"IV'i")(V'j"l- Then, by the 
subadditivity and unitary invariance of the von Neumann entropy, we find 

S{£T) = Sip^) = s{Ua{p^ ® \m\)u\) < Sip"^-) + (11) 

where 

S{p^^) = S{Y,P^-^^^CA[UAi\^^-){V>^-\ ® |0)(0|)[/i]) 

is the average density operator for the reduced state of Alice's waste area and where 

= S{J2Pi-T^^WA[UA(.\^ir^)M » \Om)u\]). 

If we define 

p^^ = Ttca [UAi\^i«)M ® mO\)u\] and 

pfn^ = Ttwa [UA{\Vi^)M » \0){0\)U\] , 

and note that n{RA + 6) > S{p'^^), since /o'^-* is a state on a Hilbert space of dimension at most 
2n(RA+s)^ we can then use Eq. lITT) to conclude that 

Ra + S> S{£a) - \ [x{XV^-.p7^^^) - Y.V^-S{pt^)]^ 

= s{£a) - \ [xHm-.pl'}) - Y^vv^s{p%-)]^ , (12) 

where in the last line we have used that S{p^'^) = S{pfn'^). An analogous inequality obviously 
holds for B. At this point, we have come close to isolating the distinguishability left behind in 
the Alice waste area, in the form of ^x({Pj")Pi^^})/ which goes to as n ^ oo and e ^ by 
Proposition lIII.2l But our expression also depends on the average mixedness of the channel states 
pfn"^- We can control this through a series of inequalities that follow from the properties of X/ 
however: 

x{{Pi-,pfr.''})+x{{Pi-,P%''}) > x(te",/o^^ 



> 



x{{Pi-,D{p%^(S)p%n}) 



>Si£TB)-nh{e), 

where h{e) — > as e — > 0. The three inequalities follow, in order, from the superadditivity of x 
for ensembles of product states il9l] . the Lindblad-Uhlmann monotonicity of x under quantum 
channels, and the Fannes inequality (tE]. Again using n{RA + 6) > S{p'~^^) and n(i?B + 5) > 
S{p^'^), this inequality implies that 

Y,Pi" (^(pf.-") + S(/5f„^)) <n{RA + RB- S{£ab) + 26 + h{e)) . 

This, in turn combined with Inequality iT2l and its counterpart for Rb, yields, by invoking Propo- 
sition inni 

2iRA + Rb) > Si£A) + Si£B) + S{£ab) - 45 - 25(e) - h{e), 
and we are done. □ 
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IV. OPTIMAL COMPRESSION FOR SOURCES OF BELL STATES 

The result of the previous section, that distributed compression of irreducible ensembles of 
product states generically cannot take full advantage of classical correlations, may be somewhat 
discouraging. Fortunately, this is not quite the end of the story. In this section we consider mix- 
tures of Bell states. The quantum correlations present in the ensemble allow us to use a variation 
on the hashing protocol for purifying EPR pairs [8], combined with a type of superdense coding. 
This protocol is fully efficient, in the sense that the total number of qubits communicated matches 
the Schumacher bound for the joint ensemble. We will show the following: 



Theorem IV.l Let 



£ab 



PI, |<^+) = ^(|00) + |11)) 

P2, ir) = ;^(|oo)-|ii)) 

P^^ |V'+) = ^(|01) + |10)) 

P4, ir) = ^(|oi)-|io)) 



(13) 



be an ensemble of Bell pairs, and let H = H{pi,p2,P3,p4). Then the rate pair {Ra, Rb) can be achieved 
by distributed compression if and only if 



Ra > H/2 and Rb > H/2. 



(14) 



A. Proof of achievability 

While the states in the ensemble are highly entangled, they are also mutually orthogonal. So, 
while the ensemble Sab is highly quantum mechanical from the points of view of Alice and Bob, 
it is classical from the point of view of the decoder, whose operations are not encumbered by any 
locality constraints. Our protocol makes use of this obseration in an essential way: Alice and Bob 
will perform a series of local unitary operations before sending some fraction of their Bell pairs 
to Charlie, who will then perform a measurement to establish the identity of the states he has 
received. By appropriate choices of the local operations, all the information about the input can 
be hashed into the identity of the state sent to Charlie. 

A Bell pair can be labelled by a pair of bits. We will follow the convention of Ref . in which 
the Bell pair state |0)|yi) + (— — yi) is represented by the label (j/i, 2/2)- This labelling has 
the property that given two Bell pairs described by (2/1,2/2) and (^i, 2:2), local unitary operations 
suffice to add zi or Z2 to either of yi or 7/2- For example, a bilateral CNOT can be used to implement 
the transformation 

{zi,z2),{yi,y2) ^ {zi + yi,z2),{yi,y2 + Z2). (15) 

(Note, however, that although the operation succeeds in adding Z2 to y2, there is an unavoidable 
"backaction" on 7/1.) With this convention, a sequence of n Bell pairs can be described by a 2n-bit 
string, which we shall denote by x". This string, in turn, can be considered as a concatenation of 
two strings x'-^ and that are 2m and 2{n — m) bits long, respectively, x^ will represent the bits 
that Alice and Bob send through the channel to the decoder, and x^ will represent the bits that 
are thrown away. 

We will use a protocol in which Alice and Bob share 2m random 2(n — m)-bit-long strings s{k), 
where k ranges from 1 to 2(n — m); the necessity of sharing randomness can be removed from the 
final protocol by observing that the average fidelity of the protocol is the probability expectation 
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(over the shared randomness) of the average fidelities of schemes with the value of the shared 
radomness fixed. 

The protocol is much like hashing and consists of 2m rounds of the following procedure. In the 
kth round, given the random strings above, Alice and Bob replace with x^' = x'j^ + s{k) ■ x^ 
using local operations as discussed above. The effect of these operations will be to perform 2m 
random "bit masks" on the string that is ultimately measured by Charlie, who therefore extracts 
the parity of a random subset of bits. After every two rounds 2 j and 2 j + 1 (where j ranges from 1 
to m), Alice and Bob put the Bell pair described by the bits x'^.j and aside. Finally, they send 
all m pairs to Charlie, who measures it in the Bell basis to ascertain x^^ and x^^j^^. 

We wish to determine the minimal m such that Charlie can decode the original n pairs with 
near-vanishing error probability. Consider two strings x" and y", where x" is the true initial 
string. We will evaluate the probability that x" and are different but nonetheless result in the 
same 2m decoder outcomes, i.e., the decoder cannot uniquely decode the state. Denote the event 
in which all the decoder measurements agree for x" and y"'hy E. Then 

Pr(x" / y", E) = Pr(x" / y") Pr(£;|x" / y") 

= Pr(x" / y") [Pr(x^ ^ y^) Pr(^|x^ / y"/^ _^ 

+ Pr(x^ = y^) Pr(^|x'^ = y^, x" / y")] 
= Pr(x" / y")[Pr(x^ ^ y^)!'^"" 

+ Pr(x^ = y^) Pr(^|x^ / y^, x^ = y^)], (16) 

where the last equality follows from multiplying by a factor of 1/2 for every subsequent random 
bit mask s{k) ■ x^ done by Alice and Bob. 

Now, we argue that the second term in the last equality is zero. Consider the first number j 
such that the bit x^ is not equal to y'^ . The information that actually gets sent to the decoder is 
in fact more complicated than x^ because of the random bit masks. In each case, the bit that gets 
sent is 

■^F ~ 2;^" + /(xj^, x^, x^) + y(xf , Xjl]^) 

yf = yf + /(yr,y2^,...,yr) + 5(yf,-,?/f-i), (17) 

where / takes into account the bit masks, and g takes into account the backaction due to previous 
bit masks. But since x^ = y^ for that term and x^ = y^ for < j by hypothesis, the / and g 
functions are equal, and thus x'^' / y'^' . Then Pr(£'|x'^ ^ y^ ,x^ = y^) = 0, as we wished to 
show. This yields 

Pr(x" /y",£;) < 2-2'". (18) 

Additionally, we know that a typical set of candidates for the initial sequence of size 2"(^+'^) mem- 
bers will with probability greater than 1 — 0(exp(— J^n)) contain the true initial sequence x fl?]. 
The decoding will then fail only for two reasons: the true initial sequence is outside the typical set 
or it was impossible to uniquely decode based on the measurement outcome. Therefore, 

Pr(failure) < 2"(^+^)-2" + 0(exp(-52^)). (19) 

We can see that if 2m = n{H + 25), the error probability approaches zero. The number of Bell 
pairs m that must be sent is just n times the rate at which Alice and Bob must send their qubits: 



Ra = Rb = H/2. 



(20) 
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B. Proof of optimality 

The rate pair {H/2,H/2) is also optimal: neither rate can be reduced below H/2. The total 
number of qubits sent from Alice and Bob to Charlie must be at least H by the optimality of Schu- 
macher compression. On the other hand, Alice and Bob's local density operators are independent 
of the input. Intuitively, all information about the identity of the state exists in the correlations 
between their systems. As a result, it is impossible to do better than splitting the total rate equally 
between them. For comparison's sake, observe that the Slepian-Wolf bounds in this case are 

Ra,Rb> H -1 and Ra + Rb> H. (21) 

These inequalities do not ensure Ra, Rb > H/2, so we see that even here, where it is possible to 
fully exploit the correlations, the Slepian-Wolf bounds are insufficient to describe the achievable 
rate region. On the other hand, while it is not applicable in this case. Theorem IIII.3I would have 
given the stronger bound Ra + Rb > + H) > H, which is in fact violated by our coding 
theorem. 

In order to prove optimality of the given rate pair, it is sufficient to show that Ra > H/2 re- 
gardless of the size oi Rb- In what follows, we can therefore assume that Bob noiselessly transmits 
all of his source qubits to Charlie. We can augment any high-fidelity compression scheme by a 
state preparation scheme. Imagine a state preparer, Peter, who prepares Bell states according to 
the given distribution before giving one qubit of each pair to Alice and the other to Bob. Alice and 
Bob compress these Bell states as before. If the average fidelity of the compression scheme is 1 — e, 
we can think of this augmented state-preparation/compression scheme as classical communica- 
tion from Peter to Charlie with average error probability e. The Fannes inequality [20] ensures 
that there exists a function /(e) that approaches zero as e approaches zero such that the classical 
communication rate from Peter to Charlie, measured in bits, is — /(e). 

Let us define Peter's state preparation more precisely: for each Bell state, he can prepare a 
singlet, give one of the qubits to Bob, and then act on the other qubit with an appropriate Pauli ro- 
tation before handing it to Alice. Since Bob will give all his qubits to Charlie perfectly anyway, we 
can eliminate Bob from consideration and consider an equivalent picture in which Peter initially 
shares singlets with Charlie and encodes his classical information by acting with Paulis according 
to a distribution of entropy H. In this communication channel from Peter to Charlie, Alice is the 
bottleneck: she sends qubits at rate Ra- This rate assisted by entanglement can simply be thought 
of as superdense coding; it can result in a classical transmission rate from Peter to Charlie of at 
most 2Ra- Combining this rate with our other expression for this classical transmission rate gives 

2nRA > n{H - /(e)). (22) 
Letting e ^ proves that Ra > H/2. Switching the roles of Alice and Bob completes the proof. 

V. FURTHER EXAMPLES 

In this section we present a pair of examples that are designed to illustrate the range of com- 
pression strategies available to encoders. In each case, as with the optimal Bell pair strategy, the 
key is to make make use of orthogonality in the ensemble even though it is not directly accessible 
to the encoders. 
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A. Hidden orthogonality 

Based on the results of Section |^ one might imagine that since Alice and Bob must act lo- 
cally, a system in which both Alice and Bob's ensembles are locally irreducible (and consisting of 
pure states) would suffice for Alice and Bob not to be able to take full advantage of correlations. 
However, this is not the case, as we will show in an example that demonstrates that compressing 
correlated reducible product sources can involve quite subtle strategies. This example demon- 
strates the necessity of global irreducibility in Theorem lIII.3l 

Let Sab = {1/3, \(pi)A ® \tpi)B} where 





= |o) 


(23) 




= ^/a\0) + Vl-a\l) 


(24) 


1^3) 


= |1) 


(25) 


IV'l) 


= Vl-/3|0> + v^|l> 


(26) 


IV'2) 


= |1) 


(27) 


m 


= Vl-/3|2) + v^|0) 


(28) 



and both a and (3 are assumed to be small but non-zero. This ensemble is irreducible from the 
points of view of A and B individually but is reducible for AB. That is, £a and £3 are irreducible 
hut £ab is not, since l^pi) O IV'i) -L {ipa) iV'a) and I932) O IV'2) -L |<^3) ^ {ips)- 

The encoder at A simply performs Schumacher compression at the rate S{£a) ~ H{2/3). The 
encoder at B begins by projecting onto |2) and the subspace of states orthogonal to |2), which we 
write as 12)-*-. If the outcome is |2), he sets the state to |0). This operation has the effect 1-^ iV'i) 
where li)'^) = IV'2) = IV'2) and lip'^) = |0). The effect of the operation is shown in FigureHJ The 
encoder then performs Schumacher compression on the ensemble { 1/3, | V'i ) } at rate H{2/3) + /(/?) 
where /(/?) ^ as /? ^ 0. 

The decoder first Schumacher-decompresses the outputs of Alice and Bob's channels individ- 
ually. Next he projects onto |10) and llO)-*-. Notice that 

Span(|v.i)|^i),|<^2)|V'2))c|10)^ (29) 

and that Iv^s) IV'3) = 1 10). Therefore, if the outcome is 1 10), he sets the state to \ips) IV's)- Otherwise, 
he does nothing. 

In this way, £ab can be compressed to approximately {H{2/3), H{2/3)) qubits per signal. Em- 
phasizing the need for global, not just local, irreducibility in Theorem lIII.31 we can calculate that 
for this scheme, Ra + Rb ~ 2H{2/3) ^ 1.8366. On the other hand, the lower bound from the 
theorem is 

\{S{£a) + S{£b) + S{£ab)) ~ \h{2/3) + log3 « 2.0441, (30) 

a rate which is clearly bettered by this example. 

Summarizing, Bob performs a locally dissipative operation that can only be reversed by com- 
bining his output with the output of Alice's channel. This regime, in which the ensembles are 
locally irreducible but globally reducible, seems to provide the greatest variety of effects and 
would consequently seem to be the hardest to solve in general. Indeed, the Bell state example of 
the previous section also falls into this category. These types of semi-classical strategies promise 
to frequently beat the bounds that apply to fully irreducible ensembles, but the optimal rates in 
the general case are completely unknown. 
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(a) 



ks) = |1) 



\^2) = |1) 



l^i) - |o) 



(b) 




|0> 



I^^,> = |1> 
W2) 




|o) 



FIG. 4: Hidden orthogonality: (a) depicts the ensemble states 1?/;^)} while (b) shows the ensemble after 
Bob has performed the first half of his compression operation. 



B. A hybrid strategy 

In this example, we return to the realm of orthogonal entangled states but without requiring 
that the states be maximally entangled. The compression strategy will combine ideas from the 
hidden orthogonality example of section IV A[ specifically the locally irreversible measurement, 
and the protocol for compressing Bell states in Section lTvl in which local unitary transformations 
were used to "piggyback" extra information onto the fraction of states sent to the decoder. 

Let Eab be an ensemble consisting of two orthogonal states, | ip^) and | (^1), in occurring 



with probabilities po arid pi, respectively. By a result of Walgate et al. \Tv^ we may assume without 
loss of generality that 



|99o> = ao|00)+/3o|ll) and 
= ai|01)+/3i|10), 



(31) 
(32) 



since any other ensemble will be locally equivalent to one of this type. 

As we said, the idea behind this example is to combine two different strategies. Suppose, 
given a state \Lpi) drawn from Eab? that Alice performs a projective measurement in the standard 
{|0), |1)} basis, whose outcome is First, observe that if she sends the outcome on to Charlie and 
Bob also sends his state to Charlie, then Charlie can uniquely identify i, the identity of the input 
state, (i is a function of the parity of the outcomes of local measurements in the standard basis.) 
Whenever Alice's measurement outcome is not independent of the (classical) post-measurement 
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state on Bob's system, compression of Alice's communication below the rate H{j) will be possible, 
according to the Slepian-Wolf theorem. Up to this point, the strategy is effectively classical. To 
go beyond Slepian-Wolf, given a state \(pin) drawn from i"^]^, we will have Alice measure only 
n — m states, encoding information about the outcome on the remaining m, which will be sent to 
Charlie. 

Let us estimate the rate achievable using this procedure. Denote by qj the probability that 
Alice gets outcome j, by oj^ Bob's state given that Alice has measured j and by £' the ensemble 
{qj,ujf}. Then 

= — V]pi|aip|i)(i| and = — V]pi|/3iphi)(^2|. (33) 
Qo Qi 

Alice will perform the measurement on the product register W = Ai . . . A\^r\, where \ W\ = n — m. 
The number of typical strings will be roughly exp^WlH {£')). Moreover, that set will partition 
into subsets of size roughly exp(|W^|x (<?')) (arid a low-probability remainder) for which Bob's 
density operators can be distinguished with negligible probability of error. Hence, Alice will only 
need to send \W\{H{£') — x{£')) bits. This she will do by applying unitary encodings on her 
unmeasured states. Denote by £" the ensemble of states {U 1)p'^^{W /), for a set of unitaries 
satisfying EC/i/'^/^ = // dim(A) for all states ip and p^^ = ^iPi/pf^- (For qubits, applying a 
random Pauli operator will do.) By the Holevo-Schumacher- Westmoreland (HSW) theorem |2^ 
I23I) this encoding of classical information in quantum states can achieve the communication rate 

X{£") = S{£") - ES{{U I)p^^{U^ ® I)) (34) 
= logdim(A) + S{p^) - S{p^^). (35) 

Thus, requiring that 

mxi£") = (n - m) {H{£') - x{£')) (36) 

yields a rate m/n for AUce of 

p,_ Hi£')-xi£') 

H{£')-x{£') + x{£"y ^ 

As strange as this formula looks, it is important to observe that if = 1/2 and = Pi = l/V^, 
we recover the optimal rate Ra = 1/2 from our study of the compression of Bell states. In our 
proposal, however. Bob must always send at a rate Rb = S{£b), which is not optimal in this case. 

We will now show more carefully that this procedure actually works. The argument will es- 
sentially just require patching together known results. The versions we present here are all from 
Ref. 1 13]. First, we will need the HSW theorem: 



Theorem V.l (Holevo-Schumacher-Westmoreland f22Ll23l]) Consider the ensemble £ = {pi,pi}. 
For < r < 1, A < 1, and sufficiently large n, there is some 6 < K'/ ^Jn such that the following 
holds: For a subset A of the ensemble £'^^ such that the total probability of the states in A is greater than 
or equal to r, and a classical alphabet M = {1, . . . , 2"^}, there exists a code (composed of a function f that 
maps elements of M to codestates 'yu '■= Ph® Pin ^ ^ ^^'^ observable E on the Hilbert space of the 
codewords) such that the maximum error probability (defined as maxfcjl — tr{'jkEk) '■ k e M}) is A, and 

P > x{£) - 



This, in turn, implies the code partition theorem, which we will also use: 
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Theorem V.2 Again, consider the ensemble £ = {pi, pi} and the n-block version, S®"^. For any X,S,ri > 
and for sufficiently large n, there exist m < 2"^^^^^^'^^^^^^^^ many n-block codes (as in HSW) with 
maximum error probability A and pairwise disjoint "large" codebooks Ci. |Cj| > 2"'(^(^)~^^) such that 
Pr {state from 8'^'' not in U^i Ci] < rj. 

Finally, the gentle measurement lemma will also be useful. This result ensures that if Charlie can 
ascertain Alice and Bob's states with near-zero chance of error, then he can do so without causing 
any significant disturbance. (In this lemma, || • ||i denotes the trace norm.) 

Lemma V.3 Let {pa}, a ^ Abe a family of states, and E an observable indexed by b e B. Let cf) : A ^ B 
he a map and let there &e A > such that for every a & A,l — tr (^^^'^(q) ) < A, i.e., the observable identifies 
<p{a)from pa with maximal error probability A. Then the measurement disturbs the states pa very little: for 
every a e A, \\pa - Y.beB VEbPaVEbh < VSX + A. 

According to the code partition theorem, for any X,6,r] > and sufficiently large \W\, the 
ensemble "partitions" into at most exp(| VF| )— x(<?')+3<^)) codes, each with probability 

of error at most A and containing at least exp(| VF| {x{£') — 25)) codewords, such that the probability 
of any state in not lying in any of the codes is less than ry. By the HSW theorem, for any 
X',6' > 0, Alice can find a second code based on £" with maximum error probability A' containing 
at least exp{m{x{£") — 25")) codewords. Therefore, she will be able to send the identity of the 
code from the code partition theorem this way provided 

mixin - 26') >{n-m) {H{£') - x{£') + 3<5) , (38) 

which gives the same rate we found earlier in our rough estimate. It remains to show that Charlie 
can still recover the original state once he has decoded the piggy-backed information about the 
code identity. The probability of error in identifying the code is bounded above by A'. Let D 
be the complement of W (the system sent from Alice to Charlie) and recall that the identity of 
the code, call it k, is encoded by applying a unitary operator Uk ® Ib to the state p^ . In reality, 
however, p^ is an average over input states: p^ = Pio^io. Let \'fiD^k) = {Uk ^ lB)\^iD) and 
let TjD = yJEkK^iD^jyJEki be Charlie's post-measurement state. By the gentle measurement 
lemma. 



^Pio\WiD,k-TiD,kh = \^PiD\i'^){'>''^\®'fiO,k-^PiD\i''^){^^\®'T'iD,k 



(39) 



< VSA' + A'. (40) 

Now, the total probability of error on the first n — m states is bounded above by A' + A + ry. On 
the rest, the decoding consists of applying C/^, ( gi Ip , where k' is the measured code. Noting that 
1 — F{p, a) < ^IIp — for any states p and a 1241] . we find that the average fidelity goes to one 
as A' goes to zero. Thus, the overall average fidelity goes to one as A' + A + goes to zero. 



VI. DISCUSSION 

We have studied the problem of performing distributed compression on a source of correlated 
quantum states. For some sources, namely sources of an irreducible set of product states, we find 
that it is much harder to exploit correlations in a compression protocol than would be suggested 
by the classical Slepian-Wolf theorem. We did not attempt to find a coding strategy matching 
the bound of our Theorem IIII.3I Indeed, since its first formulation in L141 , we found the lower 
bound so odd that none of us even suspected that it might be tight. (It did lead us to develop 
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some unwarranted pessimism about the problem, however. We included in an earlier preprint 
version of this paper the erroneous assertion that local Schumacher compression is optimal for 
compression of irreducible product state sources. That is true if only unitary decoding operations 
are permitted but not in general.) In any case, the very recent solution of the quantum Slepian- 
Wolf problem with free classical side-communication (25l] . which occurred roughly a year after 
initial posting of the present paper, and coding results obtained thereafter for our model without 
classical communication [26], show that the rate pair [S{£a), \ {S{£b)+S{£ab) — S{£a))) is indeed 
universally achievable. In other words, quite surprisingly, our bound of Theorem IIII .31 is tight in 
the sense that it gives the complete rate region for irreducible ensembles. 

For sources of Bell states, on the other hand, we demonstrated an optimal method of compres- 
sion based on the hashing protocol for entanglement distillation, that fully exploits the quantum 
correlations between the two encoders. Nonetheless, the optimal rate region is not captured by the 
direct quantum analog of the classical result due to Slepian and Wolf, nor by our tighter bound. 
We also provided some other examples to illustrate the types of protocols that might occur in an 
intermediate regime, where it appears possible to exploit some of the correlations between the 
local sources but not all. 

Thus, as compared to the classical version of the problem, we find a bewildering array of dif- 
ferent strategies and achievable rates that are not easily synthesized into a single formula. Finding 
such a formula and a uniform approach to the problem integrating all possible ensembles remains 
an important open problem. 

A Postscript: We note that the more recent studies (H, |3] change the model slightly: the 
source is described not by an ensemble but by a density operator. Compression has to succeed for 
all possible decompositions of that density operator into pure state ensembles, which is equiva- 
lently described by saying that the purification of the source density operator has to be preserved 
with high (entanglement) fidelity. It turns out that in this model one can show, regardless of the 
source, that Rb > \{S{£b) + S{£ab) - S{£a)), and analogously for Ra |27]. Hence, even in this 
related but different model we are rather close to understanding the full rate region. 
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